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Abstract 

Ambisonics is a technique for the spatialization of 
sound sources within a circular or spherical loud¬ 
speaker arrangement. This paper presents a suite of 
Ambisonicsprocessors, compatible with most stan¬ 
dard DAW0 plug-in formats on Linux, Mac OS X 
and Windows. Some considerations about usabil¬ 
ity did result in features of the user interface and 
automation possibilities, not available in other sur¬ 
round panning plug-ins. The encoder plug-in may be 
connected to a central program for visualisation and 
remote control purposes, displaying the current posi¬ 
tion and audio level of every single track in the DAW. 
To enable monitoring of Ambisonics content without 
an extensive loudspeaker setup, binaural decoders 
for headphone playback have been implemented. 
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1 Introduction 

It turns out to be difficult finding platform inde¬ 
pendent audio plug-ins for encoding and decod¬ 
ing Ambisonics. Following section should give a 
brief overview of still maintained plug-ins. 

Fons Adriaensen’s AMB-plugin^j offer en¬ 
coders and rotators (yaw axis only) until 3 rd 
order. LADSPA can be used with the DAW Ar¬ 
dour under Linux and Mac OS X. There is no 
Windows host supporting LADSPA. For decod¬ 
ing Ambisonics signals into loudspeaker feeds, 
Adriaensen’s Jack client application AmbDec 
|Adriaensen, 2005 [ is often used. 

Bruce Wiggins offers his WigWar^] plug-ins 
in VST format for Windows and MacOS X. 
These processors include 2D and 3D encoders 
until 3 rd order as well as 1 st order decoders for 
several fixed loudspeaker arrangements. 

Daniel Courville’^] Audio Unit (Mac OS X 

1 Digital Audio Workstation 
2 http://kokkinizita.linuxaudio.org 
3 http://www.brucewiggins.co.uk 
4 http://www.radio.uqam.ca/ambisonic/ 


only) plug-in suite offers 3D encoders for 1 st and 
2 nd order as well as 2D encoders for 5 th order. 

For the plug-ins created in this work, 
the C+ + cross-p latform programming library 
JUCE0 |Storer, 2012jj has been used to develop 
audio plug-ins compatible to most DAWs on 
Linux, Mac OS X and Windows. JUCE is be¬ 
ing developed and maintained by Julian Storer, 
who used it as the base of the DAW Track- 
tion. JUCE is released under the GNU Public 
Licence. A commercial license may be acquired 
for closed source projects. It is possible to build 
JUCE audio processors as LADSPA, VST, AU, 
RTAS and A AX plug-ins or as Jack standalone 
applications. LV2 (LADSPA version 2) support 
currently has to be added manuall y fro m the 
separate project DISTRHC0 |Coelho and Ro¬ 


drigues, 2012 


Ambisonics suffers from different existing 
standards concerning channel order and normal¬ 
ization. To overcome this problem, a conver¬ 
sion tool is included in the plug-in suite. En¬ 
coders, rotators and decoders from the authors 
suite are designed for the ACN channel order 
and SN3D norm alization, proposed by |Nach- 


bar et ah, 2011]. Conversion between standards 


of the input and/or output format can be done 
by the conversion plug-in. 

Apart from platform compatibility, some con¬ 
siderations about usability did result in features 
of the user interface, not available in other sur¬ 
round panning plug-ins. Continuously rotat¬ 
ing a sound source results in a discontinuity of 
the angular representation between —180° and 
+180°. This jump is also reflected when draw¬ 
ing automation curves resulting in a mismatch 
between the visual representation and auditory 
perceived movement of a sound source. A solu¬ 
tion allowing to define absolute starting points 
and angular velocities for relative movements is 


5 http://www.rawmaterialsoftware.com 
e http://distrho.sourceforge.net 



















proposed. 

For headphone monitoring several binaural 
decoders have been implemented simulating the 
Ambisonics half sphere of the medium sized 
IEM Cube with 24 speakers and the concert 
hall MumuthQ with 29 speakers in a elliptical 
stretched half sphere. 

For visualization and external control pur¬ 
poses, a bidirectional Open Sound Control 
(OSC) communication layer has been imple¬ 
mented in the encoder plug-in. An external pro¬ 
gram is able to display the current position and 
audio level of every track in the DAW. The visu¬ 
alization program may also take control over the 
sources. This can be very useful in performance 
situations while having a multitrack playback 
coming from the DAW and a central display to 
control the position of the individual tracks. 

Currently no audio plug-in format can handle 
dynamic input/output channel counts. There¬ 
fore all plug-ins may be compiled for fixed Am¬ 
bisonics orders. 
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Figure 1: Ambisonics production and playback 
chain 

2 Encoder 

The encoder plug-in distributes a mono source 
signal into Ambisonics channels (derived from 
spherical harmonic functions), according to az¬ 
imuth and elevation settings, representing the 
coordinates on a spherical surface. An addi¬ 
tional parameter called size [0,1] may be used 
to adjust the spatial directivity. Adjusting the 
size parameter from zero towards one results in 
a scaling of the higher order components. For 
a size setting of 1, all Ambisonics channels will 
be zero except the 0 th order (also known as W 
channel), resulting in an equally distributed sig¬ 
nal over all loudspeakers. 


2.1 Automation parameters 

Most current DAWs are limited to represent au¬ 
tomation parameters between 0.0 and 1.0 along 
a time line. Panning plug-ins usually map this 
range for azimuth and elevation between —180° 
and +180°. A full circle rotation results in 
a mismatch between the visual representation 
(Fig. § of the automation curve and the per¬ 
ceived continuous rotation of the source. Ad¬ 
ditionally the plug-in host may interpolate be¬ 
tween a jump from 1.0 to 0.0, resulting in a very 
fast audible jump. 



Figure 2: Automation curve for full circle rota¬ 
tion using Ardour and AMB plug-ins 

To overcome this problem, automation pa¬ 
rameters (Fig. [3]) have been added for setting 
start points (SetAzimuth, SetAzimuthRel) and 
angular velocities (tgl-rot-azimuth). The maxi¬ 
mum speed of the angular velocity may be ad¬ 
justed between 0 and 360 deg/sec by an addi¬ 
tional parameter (max-speed) . This guarantees 
a wide range of adjustment and at the same time 
accuracy for the rotation speed. 



Figure 3: Encoder automation parameters 

2.2 Remote control and visualization 


‘ http://www.kug.ac.at/en/ 
studies-further-education/studies/ 
infrastructure/the-mumuth.html 


Keeping track of all sound source positions 
within a DAW may be a difficult task for the 
mixing engineer. To allow a better overview 






















































and control of the spatial scenery, a coopera¬ 
tive visualization and control unit has been im¬ 
plemented. All encoder plug-ins are equipped 
with a bidirectional OSC layer (Fig. [4]), send¬ 
ing and receiving control and status parameters. 
Currently a functional prototype has been im¬ 
plemented in Pd/GEM (Fig. [5]) displaying all 
tracks (encoders) on a sphere, including visual¬ 
ization of their audio levels. Currently the audio 
level is represented by the variable length of the 
cylinder representing a source signal. This con¬ 
cept may be extended to a more sophisticated 
implementation. 


individual OSC/UDP connection from visualization to encoders 



Figure 4: Encoder OSC communication with re¬ 
mote visualization and control 
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Figure 5: Visualization and remote control of 
several encoders with Pd/GEM 

3 Rotator 

The rotation plug-in (Fig. [6]) may be used to 
manipulate the orientation in the Ambisonics 
domain, as described in |Musil et ah, 2003]. 
An optimized way to calculate rotation matri¬ 
ces can be found in |Rumori, 20091. Therefore 
yaw, pitch and roll can be adjusted by ±180°. 
This is very useful for the incorporation of head 


movements during binaural playback. The ro¬ 
tation plug-in listens to an adjustable UDP port 
for incoming OSC messages. This allows to by¬ 
pass host automation and controlling the rota¬ 
tion directly from the head tracking software 
(Sec. glj. 
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Figure 6: Ambisonics Rotator 

4 Binaural decoder 

The binaural decoder computes virtual loud¬ 
speaker feeds as a linear combination of the Am¬ 
bisonics signals according to a given decoding 
matrix. These virtual loudspeaker signals are 
convolved with their individual stereo impulse 
responses, modeling the transfer path from the 
loudspeaker position to the left and right ear of 
the listener (Fig. [l]). 

4.1 Head tracking 

Head tracking is a significant feature for virtual 
reality scenes and headphone playback. Small 
head movements change the relative position of 
a sound source in aspect to the listeners ears, 
making localization more easy and removing 
ambiguity. 

The Kinect™ sensor as add-on for the gam¬ 
ing console XBox 360™ by Microsoft offers a 
low budget depth sensor. |Fanelli et ah, 2011 
developed a software to gather head orienta¬ 
tion angles and the head position relative to the 
Kinect sensor (Fig. [7]). The author extended 
this head pose estimation software about send¬ 
ing OSC data to the Ambisonics rotator plug- 
irEJ The Ambisonics sound field has to be ro¬ 
tated in opposite direction to keep the sound 
source positions fixed and suppress the rotation 
with the listeners head. 


8 http://github.com/kronihias/ 
head-pose-estimation 
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Figure 7: Head pose estimation 


4.2 Matrices and virtual venues 

To allow monitoring in various virtual scenes, 
different decoding matrices and sets of 
loudspeaker-to-head impulse responses were 
used. 

4.2.1 Reduced decoder (3rd order 3D) 

This binaural decoder uses specific symmetry 
relations between the HRTFs and decoder ma¬ 
trices to achieve a reduced set of head releated 
impulse responses, directly applicable on the 
Ambisonics Signals withou t com puting virtual 
loudspeaker signals|Musil et ah, 2007 . This ap¬ 
proach reduces the numbers of convolutions and 
therefore the CPU load. The algorithm is im¬ 
plemented in the iem_bin_ambi library for Pure 
data. 


4.2.2 IEM Cube and Mumuth decoder 
(4th and 5th order 3D) 

The 120m 2 IEM Cube (Fig. ^ serves as the 
main lab of the Institute of Electronic Music 
and Acoustics Graz. 24 Tannoy coaxial speak¬ 
ers are mounted in a hemispherical arrangement 
consisting of three rings (12 - 8 - 4). During the 
years of Ambisonics research at the IEM, sev¬ 
eral different approaches for finding an optimal 


decoder matrix have been taken Zotter et ah, 


2012; Sontacchi, 2003 . The IEM Cube binaural 


decoder implemented in the authors Ambison¬ 
ics plug-in allows to switch between the different 
available decoder matrices. 

The Mumuth (Fig. [9]) was opened in 2009 
as multi purpose venue for the University of 
Music and Performing Arts Graz. It houses 
the 600m 2 Gyorgy Ligeti concert hall. The 33 
Kling&Freitag CA 1001 SP loudspeakers (29 are 
used for the half sphere, 4 more speakers are lo¬ 
cated in the corners) may be arranged by a spe¬ 
cial motor controlled mounting, resulting in a 


versatile loudspeaker setup that can be changed 
within a minute. The Mumuth binaural decoder 
uses the Ambisonics decoder matrix and loud¬ 
speaker settin g for the IEM Demosuite |Plessas 
and Zmolnig, 2012 by Thomas Musil. 

The impulse responses of the IEM Cube 
and the Mumuth were recorded by Martin Ru- 
mori and David Pirro as part of the artistic 
research project The Choreography of Sound 
(CoS) |Eckefet al., 2012| . 



Figure 8: IEM Cube 



Figure 9: Mumuth 


5 Ambisonics converter 

Since the beginning of Ambisonics in the 1970s 
and the extension to Higher Order Ambisonics, 
several different approaches for arranging and 
normalizing the spherical harmonic components 
have been taken. A good summary may be 
found in [Nachbar et al., 20TT and [Chapman et 
al., 2009| . All plug-ins in this suite are operating 
with the ACN channel order and SN3D normal¬ 
ization, proposed in |Nachbar et al., 20111. The 
Ambisonics converter plug-in allows to inter¬ 
change the normalization schemes SN3D, N3D, 
Furse-Malham and the channel order schemes 
ACN, SID and Furse-Malham. Thus, it is pos¬ 
sible to incorporate various standards into the 
production chain. 
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Figure 10: Converter plug-in 

6 Summary 

This paper presented an Ambisonics proces¬ 
sor suite, usable in production or live perfor¬ 
mance. The JUCE framework proved to be a 
stable solution for developing cross platform au¬ 
dio plug-ins and standalone applications. By 
the time of writing this paper, no Mixed-Order 
systems have been implemented. Mixed-Order 
Ambisonics uses different orders for the horizon¬ 
tal and the vertical part of the sound field. It 
is planned to incorporate this into the converter 
plug-in. 
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